Close

%0 Conference Proceedings
%4 sid.inpe.br/sibgrapi/2018/10.24.00.46
%2 sid.inpe.br/sibgrapi/2018/10.24.00.46.42
%T The usage of U-Net for pre-processing document images
%D 2018
%A Kubo, Diandra Akemi,
%A Nazare, Tiago Santana de,
%A Aguirre, Priscila Louise Ribeiro,
%A Oliveira, Bruno Domingues,
%A Duarte, Felipe Simões Lage Gomes,
%@affiliation Data Science Team - Itau Unibanco
%@affiliation Data Science Team - Itau Unibanco
%@affiliation Data Science Team - Itau Unibanco
%@affiliation Data Science Team - Itau Unibanco
%@affiliation Data Science Team - Itau Unibanco
%E Ross, Arun,
%E Gastal, Eduardo S. L.,
%E Jorge, Joaquim A.,
%E Queiroz, Ricardo L. de,
%E Minetto, Rodrigo,
%E Sarkar, Sudeep,
%E Papa, João Paulo,
%E Oliveira, Manuel M.,
%E Arbeláez, Pablo,
%E Mery, Domingo,
%E Oliveira, Maria Cristina Ferreira de,
%E Spina, Thiago Vallin,
%E Mendes, Caroline Mazetto,
%E Costa, Henrique Sérgio Gutierrez,
%E Mejail, Marta Estela,
%E Geus, Klaus de,
%E Scheer, Sergio,
%B Conference on Graphics, Patterns and Images, 31 (SIBGRAPI)
%C Foz do Iguaçu, PR, Brazil
%8 29 Oct.-1 Nov. 2018
%I Sociedade Brasileira de Computação
%J Porto Alegre
%S Proceedings
%K #deep-learning #computer-vision #image-processing.
%X When processing documents in real-world scenarios, it is common to deal with artifacts that may hamper document analysis, such as stamps, noise and strange backgrounds. Aiming to mitigate these problems, we propose the use of U-Net, a very successful biomedical image segmentation network, for handwritten and machine text segmentation. In order to do so, we trained a model for each type of text. One of the main advantages presented is that the models are trained on artificial data, avoiding the wearisome task of data labeling. For the machine text segmentation model, we test its impacts on both word and character recognition when combined with the Tesseract OCR model. For the handwritten segmentation model, we present qualitative results. Initial experiments indicate that both models are able to improve results in their respective applications.
%@language en
%3 sibgrapi_pi_cv.pdf


Close